Unsupervised Analysis for Decipherment Problems
نویسندگان
چکیده
We study a number of natural language decipherment problems using unsupervised learning. These include letter substitution ciphers, character code conversion, phonetic decipherment, and word-based ciphers with relevance to machine translation. Straightforward unsupervised learning techniques most often fail on the first try, so we describe techniques for understanding errors and significantly increasing performance.
منابع مشابه
Unsupervised Analysis of Structured Human Artifacts
Unsupervised Analysis of Structured Human Artifacts by Taylor Berg-Kirkpatrick Doctor of Philosophy in Computer Science University of California, Berkeley Professor Dan Klein, Chair The presence of hidden structure in human data—including natural language but also sources like music, historical documents, and other complex artifacts—makes this data extremely difficult to analyze. In this thesis...
متن کاملComparison school bonding and interpersonal problems in students with unsupervised and abused families with normal
This study aimed to compare the school bonding and interpersonal problems in students with unsupervised and abused families with normal families in Bandar Lengeh. The sample consisted of 152 normal students and 81 unsupervised or abused students. Normal students were selected by the multi-stage cluster sampling method. Data were collected through two questionnaires: school bonding (Rezaei Shari...
متن کاملExploiting Machine Learning Techniques to Perform Side Channel Attack
This paper proposes a novel unsupervised learning approach for Power Analysis – a form of side channel attack in Cryptanalysis. Different from existing works that exploit supervised learning framework to solve this problem, our method does not require any labeled pairs which contains {X,Y}={key, power-trace} information, but is still capable of deciphering the secret key accurately. Besides pro...
متن کاملUNRAVEL - A Decipherment Toolkit
In this paper we present the UNRAVEL toolkit: It implements many of the recently published works on decipherment, including decipherment for deterministic ciphers like e.g. the ZODIAC-408 cipher and Part two of the BEALE ciphers, as well as decipherment of probabilistic ciphers and unsupervised training for machine translation. It also includes data and example configuration files so that the p...
متن کاملUnsupervised Consonant-Vowel Prediction over Hundreds of Languages
In this paper, we present a solution to one aspect of the decipherment task: the prediction of consonants and vowels for an unknown language and alphabet. Adopting a classical Bayesian perspective, we performs posterior inference over hundreds of languages, leveraging knowledge of known languages and alphabets to uncover general linguistic patterns of typologically coherent language clusters. W...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006